AITopics | learning generalizable device placement algorithm

Collaborating Authors

learning generalizable device placement algorithm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning Generalizable Device Placement Algorithms for Distributed Machine Learning

ravichandra addanki, Shaileshh Bojja Venkatakrishnan, Shreyan Gupta, Hongzi Mao, Mohammad Alizadeh

Neural Information Processing SystemsOct-2-2025, 23:47:42 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, learning generalizable device placement algorithm, machine learning, (5 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Reviews: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning

Neural Information Processing SystemsJan-24-2025, 16:48:59 GMT

Originality: The use of graph neural networks appears novel (concurrent with Paliwal), as does the sweep order (for which I don't know other papers, at least for this application of graph neural networks). The trick of using architecture search as a dataset also seems novel, and I'm quite happy with this idea. Quality: The submission is sound, but I have a few minor concerns: 1. It's possible REINFORCE is good enough, but I'm skeptical given that (1) REINFORCE is much worse in normal RL environments and (2) the paper explicitly presents evidence that using an incremental baseline helps learning. The learned value function in PPO, Q-learning, etc. could potentially play the same variance reduction role or even do quite a lot better (presumably not all of the variance due to upstream moves is explained by reward so far).

artificial intelligence, machine learning, reinforcement learning, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.58)

Add feedback

Reviews: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning

Neural Information Processing SystemsJan-24-2025, 16:48:49 GMT

The paper introduces a new RL-based approach to device placement in computation graphs that relies on using graph embedding neural network instead of RNNs. The reviewers were all impressed by the novelty of the proposed approach, the significance of the empirical results, as well as by the ability of the method to generalize across different tasks. While preparing the final version, please take into account the detailed comments and suggestions mentioned in the reviews.

artificial intelligence, learning generalizable device placement algorithm, machine learning, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.37)

Add feedback

Learning Generalizable Device Placement Algorithms for Distributed Machine Learning

Neural Information Processing SystemsOct-10-2024, 07:09:36 GMT

We present Placeto, a reinforcement learning (RL) approach to efficiently find device placements for distributed neural network training. Unlike prior approaches that only find a device placement for a specific computation graph, Placeto can learn generalizable device placement policies that can be applied to any graph. We propose two key ideas in our approach: (1) we represent the policy as performing iterative placement improvements, rather than outputting a placement in one shot; (2) we use graph embeddings to capture relevant information about the structure of the computation graph, without relying on node labels for indexing. These ideas allow Placeto to train efficiently and generalize to unseen graphs. Our experiments show that Placeto requires up to 6.1x fewer training steps to find placements that are on par with or better than the best placements found by prior approaches.

artificial intelligence, machine learning, placement, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.62)

Add feedback

Learning Generalizable Device Placement Algorithms for Distributed Machine Learning

addanki, ravichandra, Venkatakrishnan, Shaileshh Bojja, Gupta, Shreyan, Mao, Hongzi, Alizadeh, Mohammad

Neural Information Processing SystemsMar-18-2020, 22:02:22 GMT

artificial intelligence, machine learning, placement, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.62)

Add feedback